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ABSTRACT 


The  Information  Processing  Techniques  Program  sponsored 
by  DARPA  at  Lincoln  Laboratory  consists  of  three  efforts: 
Packet  Speech,  Acoustic  Convolvers,  and  Airborne  Command 
and  Control.  In  this  Semiannual  Technical  Summary,  the  first 
two  areas  are  reported  in  Vol.  I and  the  third  in  Vol.  II. 
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INFORMATION  PROCESSING  TECHNIQUES  PROGR  AM 


I.  PACKET  SPEECH 


A.  OVERALL  AIM  OF  PACKET  SPEECH  PROGRAM 

The  long-range  objective  of  the  Packet  Speech  Program  is  to  develop  and  demonstrate 
tecliniques  for  efficient  digital  speech  communication  on  networks  suitable  for  boUi  voice  and 
data,  leading  eventually  to  better  interaction  between  people  and  computers  in  a multicomputer 
network  envii o.une.a.  Ttu  IiSC  t»-  B.^vanta^’us  of  messa.-e- 

switched  data  network..  Presently,  th.  sponsored  Network  Si-cerh  Compression  (NSCi 

group  is  studying  the  effects  of  the  network  on  digital  speech  transmission.  Ultimately,  devel- 
opment of  a successful  voice-data  network  will  require  application  of  many  of  the  results  of  the 
past  half-century  of  speech  research  (drawing  upon  such  topics  as  speech  recognition,  speech 
syi  ttA’sJ - sjidwidth  compression,  and  speaker  authentication)  for  effective  digital 
speech  communication.  Current  efforts  of  the  NSC  group  are  strongly  oriented  toward  adapta- 
tion of  the  ARPANET  to  fulfill  voice  needs  and  the  problems  of  speech  bandwidth  compression. 


B.  ACTIVITIES  OF  THE  NSC  GROUP 

A major  joint  activity  of  the  NSC  group  has  been  the  development  of  a packet  speech  capa- 
bility. Of  the  eight  NSC  contractors,  Lincoln.  Information  Sciences  Institute  (ISI),  Stanford 
Research  Institute  (SRI),  Speech  Communication  Research  Laboratory  (SCRL),  and  Bolt, 

Beranek  and  Newman  (BBN)  have  procured,  via  ARPA.  the  SPS-41  computer  for  use  as  a real- 
time speech  processor.  Each  of  these  contractors  also  has  PDP-ll/40  or  PDP-H/45  faciUties 
so  that  both  speech  algorithms  and  network  protocol  programs  could  be  shared  by  all  the  above 
users.  In  May  and  June  of  1975.  several  partially  successful  speech  demunbtration.  \vbi. 
carried  out  between  ISI  and  a faciUty  at  Lincoln  composed  of  the  fast  digital  processor  (FDP) 
and  the  TX-2.  To  date,  no  reUaole  speecn  communicadca  facility  I.  available  K.  NBC. 
resolution  of  difficulties  encountered  with  processor  hardware  by  ISI  and  SRI. 

C.  SUMMARY  OF  LINCOLN'S  ACTIVITIES  IN  NSC 

Lincoln  has  continued  its  work  on  ARPA  network  measurements.  The  results  of  these 
measurements  should  strongly  influence  tne  methods  of  efficiently  piOCeSsing  speech  packets  to 
obtain  good  speech  quality.  Type  3 packets  [raw  packets  sent  without  RFNMs  (Request  for  Next 
Message)]  suggested  by  Dr.  R.  E.  Kahn  have  proved  to  yield  appreciably  higher  bandwidth  trans- 
mission. As  an  alternate  to  the  SPS-41.  Lincoln  was  asked  by  ARPA  to  construct  two  digital 
voice  terminals  (DVTs)  for  use  as  real-time  speech  processors,  hour  uVTo  have  alrcaJj-  b£on 
constructed  for  other  sponsors  and  two  more  DVTs  will  be  constructed  for  ARPA  during  cal- 
endar 1975.  Lincoln  is  also  developing  a network  speech  faciUty  which  can  interface  with  either 
a DVT  or  an  SPS-41.  When  this  facility  is  operational.  NSC  members  will  be  able  to  talk  to  each 
other  using  either  DVTs,  SPS-41's  or  the  CHI  (Culler-Harrison  Incorporated)  array  processor. 

In  addition  to  the  above.  Lincoln  has  completed  its  work  on  VELP  (voice -excited  linear 
prediction),  is  working  jointly  with  M.I.T.  in  developing  a flexible  audio  front  end,  and  has 
helped  BBN  check  out  their  PITs  (private  Une  interface)  in  a speech  environment. 


1 


i 

, r ■ ^ 


D.  NETWORK  MEASUREMENTS 

. , „ ™„Qoape  service  has  become  available  in  the 
During  this  reporting  perio  , a new  yi  aekets  and  differs  from  regular 

ARPANET.  This  service  makes  use  of  rype  P responsibility  for  sue- 

,T,P.  0,  «.v.ca  i„  .ha.  ».e  no.wor.  ass„»ea  no  J 

cessM  d.Uvony  o,  .oosaagas.  PaC.eta  a.-e  son.  to  L e.reo.  .ha.  har.en 

space  will  be  available  for  the  packets  when  ey  arr^ 

packets  are  discarded  on  "3"  ,re  delivered  to  the  receiving  host  in  the 

sending  or  receiving  host.  Furthei,  jp  P necessarily  the  order  in  which  they 

ordei  of  their  arrival  at  the  destination  IMP,  which  is  not  necessarily 

were  given  to  the  source  Type  0 were  expected  to  be  greater  data-rate 

The  advantages  of  Type  3 messag  y ,„ihpc  tr,  the  user  were 

oapahUi.,  anh  nohuoeh  .„„s..ss.o„  .h~ngh  .ho  n...o-  eon- 

oxpcclod  to  ho  somo  possibUily  ot  los.  moosagos  and  incroased 

h::'oro:'  ::ornl~h  .h.  pon.od  has  hoe„  no„eonnod  «h  ^po  3 _ 
and  the  oomparlson  ot  thoir  pe.-Iormanoa  wl.ii  respeol  .o  y^^ra^luremenl 

no  EFNM  rohtmod  ha  .ho  sanding  hos.  to.  Typo  3 tnoss.gos,  „„s«,onao„.  .och- 

.ociaaigao  .h7ooo;o,a.ion  ;t  a „„o.c  hos.  i„  ,»ning 

„i,„o  »as  J , .„„aodla.ol,  ro.ransmi.s  no.oivod  anossagos  addnig  a .inao  s.anap 

a program  called  ECHO  vl  leh  mi  y received. 

and  a s.g.onoo  nunaho,  andaoa.ang  .1.0  ot  naoasuno- 

using  an  ECHO  pnogram  runn.ng  g,,„3aad  E,  do.aii  in 

ments  of  network  delays  for  a vane  y . . , n Kppn  distributed  to  the  NSC  eommu- 

NSC  Noto  70,  -ARPANET  Do.ay  Moasnroanon.s,-  vah.oh  has  ^ 

„t.y.  Tho  da.,  troan  .ho  „oasu„mon.s  con,ar»  .ho  >.^0  daaan 

::s?:hso,yod  ...h  Typ,  0 .ossagos  .hnonghon.  .ho 

18.5  kbps).  No  Typo  3 naoss.gos  '•'■■'a  '°al.  and  ^ ano.snroanonts 

.he  higher  data  ra.es  (6.8  poreent  at  . t ps  o message  eapabilaty;  nehsork 

wore  taken,  wo  wore  osson.a.Uy  .ho  only  ac.avo  ns.rs  ot  .ho  T7>»  3 

pertormanco  anigh.  ho  dittoron.  as  regards  ...nabor  olmossag.s  los.  a.  Typo  moss  g 
being  used  by  many  hosts  on  the  net  at  the  same  time. 

E.  NETWORK  SPEECH  EXPERIMENTS 

The  TX-2/FDP  program  for  LPC  (linear  predictive  coding)  voeoded  speech  communication 
on  the  ARPANET  was  rrrriTg"er 

“r.::;":::i:eoatotordor. 

allow  the  measurement  of  the  effectiveness  of  the  playout  algo  prog-am 

message  delays  through  the  net  Lismitted 

at  a remote  host  to  send  voeoded  speech  back  to  ^ effectiveness  of  playout 

speech  and  thereby  simplify  the  debugging  of  code  as  well  as  explore 

algorithms. 
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During  the  period,  a number  of  successful  network  conversations  were  held  with  bot 
and  era.  Using  the  ECHO  program  at  ISi.  we  produced  an  analog  tape  of  a ^ 

about  1.4-sec  round-trip  delay  including  smoothing  delays  in  the  playout  \ 

delay  settings  used  for  this  tape,  about  1.5  percent  of  the  speech  packets  arrived  too  to  be 
used  by  the  playout  algorithms  and  consequently  resulted  in  "glitches"  in  the  output  speech.  1 

„L  gmch  ™.e,  whae  .ccable.  Is  „o.  .00  dls«.rbi„g  repreadn.a  . rd.aon.Mo 

target  when  seeking  a compromise  between  round-trip  delay  and  glitch  ra  e. 

N=”lr.  .peech  .xp.pl^pts,  «Moh  had  ror„dr„  used  ,P.  <acl.i.„  al»ed  a.  ™pro™g 

Ih.  playout  .Igortthm  will  be  resamed  tollomne  Imple menl.lion  ot  network  speect  ■>™S-'am 
our  PDP-ll/SPS-41  aystem.  The  Important  decision  which  that  algorithm  must  handle  m when 
o berpla^M  out  speech  alter  a sUent  inter,.,.  11  playout  is  started  too  soon,  a gl.  = . will 
resuiUl"..  pl!yout  catches  up  with  the  llo.  ol  data  Irom  the  net.  „ playout  is  delayed  too  much, 
unnecessary  delay  will  he  introduced  in  the  conversation.  The  algorithm  uecdin  e e r y 
sZol  ourprogLm  (also  in  those  at  ISl  and  Cm,  simply  waited  a lined 
out  alter  the  arrival  ol  the  lirat  speech  packet  lollowlng  a silent  mterval.  Ln  el  eet,  Ih 
packet  was  used  as  an  estimate  ol  the  mean  network  delay  and  the  lined  time  delay  was  added 
,0  allow  lor  the  variance  in  network  delay.  Unlortun.tely,  that  llr.t  packet not  a good  cst 
mate  ot  the  mean,  and  a relatively  large  lined  delay  most  be  used  to  avoid  encess.ve  gl.to  g. 
llTmte  recent  programs  have  used  a running  average  ol  dlllerences  between  arrival  times 
and  a local  clock  to  adjust  the  clock  to  a better  appronimatlon  to  the  mean  networ  e ^ 
acteristlc.  Preliminary  results  show  that  the  overall  delay  can  be  reduced  with  such  an  algo 

rithm  without  increasing  the  glitch  rate. 

F.  DIGITAL  VOICE  TERMINAL  (DVT) 

under  Air  Force  sponsorship,  Lincoln  has  designed  and  consttnCed  a -“e*" jr«e=sor 
capable  ot  pertormlng  a variety  ot  speech  bandwidth  compression  algorithms  The  DVT  i 
programmable  processor  capable  ol  enecuting  18  million  Instmctions/sec.  It 
4 wire-wrapped  boards  containing  a total  ot  470  ECL  (emitter-coupled  logic)  packages 

4 Wire  wiaiJiJc  miorp  ■nvor  fUf  into  a 19- X 5- X 25-inch  frame. 

125  TTL  (transistor-transistor  logic)  packages.  The  D „ „nprtPri 

anH  1C  thiic  self  contained  and  can  be  connected 
This  structure  contains  its  own  power  supply  and  is  thus  sen 

directly  to  a modem  (or  a PDP-11)  for  digital  speech  transmission. 

At  the  present  writing,  the  following  speech  algorithms  have  successfully  run  on  DV 

the  LPC  algorithm  at  2400,  3600,  and  4800  bits  per  second  (bps)  rates;  the  APC  P^"^ 

dictive  coding)  algorithm  at  8000  bps;  two  different  TRIVOC  (triple -functi^  ' 

both  at  2400  and  3600  bps;  and  the  ARC  (adaptive  residual  codmg)  algorithm  9600  ^ 

16  000  bps.  These  algorithms  have  been  developed  over  a period  of  several  months  By 
cltlt  Ihc  single  LPC  algorithm  developed  by  NSC  ,or  the  SPS-41  has  taken  more  than  one 
year  The  p.  imary  reasons  for  this  are  (1)  hardware  problems  in  the  SPS-44  and  (2,  muc 
rein  aas:  oi  pro^gramming  the  DVT.  in  this  latter  connection,  it  is  important  to  no«  at 
the  DVT  is  a highly  sequential  machine  whereas  the  SPS-41  is  a parallel  processor.  By  takmg 
advantage  of  increased  circuit  speeds,  a 55-nsec  instruction  cycle  time  became  feasAle  and  it 
was  determined  tnat  parallel  processing  techniques  were  no  longer  needed  for  real-time  speec 

processing. 

m Holststtcr,  P.E.  BlrnikonshlTM.  L.  Malpass.  and  S.  SensH,  -Vocoder  Imp, -mentations 
» toDiStal  Terminal,-  1975  EASCON  Proceedings, 
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r 1 nvTs  for  nacket  speech  work  on  the  ARPANET 

A future  possible  advantage  of  employing  t pri  at  a civen  data  rate 

K'TK  f thp  nVT  to  rapidly  switch  from  one  program  (say  LP  ) L 
is  the  ability  of  the  DVT  to  rapidly  flexibility  of  bit  rate 

(say  3 600  bps)  to  another  program  sue  ouitable  for  a packet  speech  network 

. spoec.  .Pa«c 

under  spunsoia  jotA  cmrT  6i  Durine  this  con- 

, • • 4 1075  (DVT  5)  and  early  January  197fa  (DVi  o).  uunue 

rr:!:’”;::.  spe.c.  pc  co„p.c.«.  - - - 

presently  existing  DVTs,  given  the  permission  of  the  sponsors 

G deveijdpment  of  a network-speech  facility 

■ Lincoln  participated  in  the  initial 
providing  the  computational  resources  One  aim  of  NS 

computer  facilities  for  network-speec  PDP-11  interfaced  to  the  ARPANET  to 

sists  of  an  SPS-41  to  process  the  speech  signals  and  standardization  of  facilities 

handle  the  flow  of  messages  through  the  network.  Because  of  this  standardizat 

1 4.  r i-uci  T’V-?  T,incoln  is  installing  such  a facility. 

and  the  phase-out  of  the  TX  . a pdp-11/45  with  a 64K  memory  was 

,pc  i„C.U..ioP  o,  .he  so«w„e  .ece.sar, 

consuming.  The  ELF  operating  System  provided  by  SC  ^LF  (VM  or 

which  network- speech  communications  would  be  made.  A P p^^ard  installing  this 

• 4,  1 ^.^nrv  ELF)  is  required  for  the  network-speech  application.  Toward  instaumg  i 

virtual-memory  ELh ) is  requn  successfully  generated  and  in- 

version of  ELF,  a more  primitive,  non-VM,  version  has  been  successfu  y g 

'‘''l^ELriaTklcomponents  necessary  for  normal  operation  by  everyday  users  an  alternate 
AS  ELF  daily  use.  Lincoln  chose  Digital  Equipment 

X7or"^tJn'rRSX-llM  as  its  general-purpose  operating  system.  System 
sembling  and  linking  the  network  software  had  to  be  modified 

SPS-41  system  programs  - an  assembler  and  a debugging  f,Jfhc 

written  for  compatibility  with  the  a 

creation  of  the  overlay  streams  used  by  the  LPC  programs  pqx-UM. 

PDP-10.  These  programs  have  all  been  successfully  modi  le  o 

Once  these  modifications  were  made,  the  SPS  LPC  programs  written  by  and  SRI  w 

Ristalled.  ISPs  modifications  to  SPUD,  wMch  load  these 

, or  processing  „e»erh™ssa,es  is  eh.aioed,  “g  speech 

pation  in  network-speech  experiments,  usmg  either  the 
processor. 
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II.  AUDIO  PRE-PROCESSING  OF  SPEECH  FOR  NSC  USE 

At  an  ARPA  NSC  meeting  held  in  Santa  Barbara  in  December  1974,  Lincoln  agreed  to  work 
on  the  standardization  problem  for  speech  input  to  a digital  processor.  This  pre-processing 
includes  the  choice  of  microphone,  pre -emphasis  filter,  gain  control,  pre -sampling  filter, 

de -emphasis  filter  and  post- sampling  filter. 

Since  an  important  ARPA  long-range  aim  is  the  development  of  remote  voice-data  inter- 
active facilities,  it  seems  appropriate  to  use  speech  inputs  which  are  close  approximations  to 
commercial  equipment.  To  this  end,  Lincoln  proposes  that  present  speech  work  be  done  with 
standard  telephone  handsets  with  the  carbon  button  microphone  replaced  by  dynamic  cartridges. 
Lincoln  has  supplied  other  NSC  members  with  this  equipment.  In  addition,  a model  audio  fron 
end  has  been  designed  at  Lincoln  which  includes  a wide  variety  of  options  for  pluggable  modu  es 
for  the  various  devices  cited  in  the  above  paragraph;  this  subject  is  covered  in  some  detail  m 

NSC  Note  57.  . 

It  has  been  the  experience  of  both  NSC  and  other  speech  bandwidth  compression  worker^ 

that  a wide  variety  of  sampling  rates  are  desired.  Each  time  the  sampling  rate  changes,  the 
pre-sampling  and  post-sampling  filter  cutoff  frequencies  also  should  be  changed.  One  way  of 
accomplishing  this  is  by  plugging  in  a different  analog  filter  each  time.  However,  this  means 
that  any  demand  for  a new  sampling  rate  requires  the  construction  of  a new  analog  filter.  As  a 
step  toward  solving  this  difficulty,  M.I.T.  and  Uncoln  are  collaborating  on  the  construction  o 
a programmable  digital  pre-sampling  filter.  Tb-  design  of  this  filter  is  being  carried  out  by 
M.I.T.  and  the  constnaction  will  be  carried  out  by  Lincoln. 


I.  eVSD  WORK 

During  this  report  period,  the  CVSDM  {continuously  variable  slope  delta  modulation)  wave- 
form encoding  equipment  has  been  used  for  two  ARPA  applications.  One  application  uses  six 
CVSDM  encoder-decoders  driving  a single  PDP-H/45  interface  structure  for  switchboard  and 
conferencing  experiments  on  speech  data  streams  from  six  users.  A second  application  has 
provided  CVSDM  encoder  data  streams  for  use  with  PLI  bit  stream  devices. 

1.  Conferencing  Hardware 

Upon  delivery  of  the  CVSDM  units  (the  end  of  calendar  1974),  six  were  dispatched  to  the 
NSC  group  at  ISI.  During  the  spring  of  this  year,  D.  Cohen's  group  at  ISI  developed  a combmer 
device  designated  the  PBll  which  accepts  data  in  from  and  out  to  several  CVSDM  devices,  com- 
bines them  into  a single  parallel  handshake  for  the  PDP-U/45,  and  provides  each  device  with 
a common  synchronized  clock.  Using  a PBH,  it  will  be  possible  to  combme  six  CVSDMs  (or 
more)  into  a single  PDP-H/45  as  a switchboard  and  use  this  common  node  as  the  basis  for 
multi-user  voice  access  to  the  net  in  both  a single-user  extension  mode  and  a many-user  con- 
ference mode.  The  plan  is  to  reproduce  a PBll  box  at  Lincoln  so  that  the  Lmcoln  PDP-H/ 
can  communicate  with  several  CVSDM  devices  connected  to  it  and  also  communicate  with  ISI, 
through  the  network,  in  several  conferencing  modes  at  CVSDM  rates. 

2.  CVSDM-PLI  Tests 

In  mid-March  1975,  two  CVSDM  units  were  lent  to  BBN  for  use  as  speech  input  devices 
for  PU  development  and  testing.  The  PU,  insofar  as  packet  speech  is  concerned,  looks  like 
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a minihost  capable  of  accepting  and  delivering  serial  speech  data  while  providing  properly  for- 
matted packets  to  a network  IMP.  Working  with  BBN  personnel,  Lincoln  staff  provided  guidance 
for  proper  PLI  speech  software  development  including  buffer  delay  for  net  smoothing  and  silence 
detection  strategy. 

Using  this  software.  PLl-CVSDM  experiments  were  performed  by  BBN  and  voice  transmitted 
tlirough  various  network  connections.  A recording  of  speech  input  and  output  from  experiments 
on  from  one  to  ten  link  patlis  was  made  using  various  CVSDM  rates.  As  expected,  the  longer 
hops  could  not  sustain  the  l6-kbps  rate  using  Type  1 message  format,  although  the  shorter  hops 
sound  fine  (i.e.,  small  delay  and  no  discontinuities).  This  recording  is  on  file  with  Dr.  R.  E.  Kahn. 

J.  VOICE-EXCITED  LINEAR  PREDICTION  VOCODER 

1 . Introduction 

The  previous  SATS  included  a preliminary  description  of  a voice-excited  linear  prediction 
(VELP)  vocoder  with  adaptive  spectral  flattening.  Further  work  on  tliis  vocoding  scheme  has 
resulted  in  an  8-kbps  VELP  system  with  output  speech  quality  roughly  comparable  to  that  of  an 
8-kbps  APC  system.*  In  addition,  VELP  quality  seems  less  degraded  than  APC,  or  than  pitch- 
excited  LPC,  when  the  input  speech  is  of  telephone  quality.  A condensed  description  of  the 
current  VELP  system  foUows;  a more  detailed  description  has  been  prepared  for  publication.^ 

2.  General  System  Description 

An  overall  block  diagram  of  the  VELP  system  is  shown  in  Fig.  I-l.  As  in  a voice-excited 
channel  vocoder,  the  excitation  information  is  transmitted  in  an  unprocessed  (except  for  sampling 
and  quantization)  sub-band  of  the  original  speech.  The  baseband  limits  must  be  chosen  so  that 
at  a minimum,  either  the  voice  fundamental,  or  else  the  second  and  third  harmonics  of  the 
speech,  are  transmitted.  With  this  I'equirement  satisfied,  the  nonlinear  distortion  network  is 
able  to  generate  a waveform  containing  all  the  original  harmonics  of  voiced  input  speech.  The 
spectral  shape  superposed  on  this  regenerated  harmonic  structure  is  quite  arbitrary  and  must 
be  flattened  to  produce  a suitable  excitation  function.  In  the  channel  vocoder  case,  this  was 
accomplished  by  a spectral  flattener  consisting  of  a bank  of  bandpass  filters,  each  followed  by 
a limiter.  Spectral  flattening  in  VELP  is  achieved  by  carrying  out  an  LPC  analysis  of  the  dis- 
torted baseband  signal,  and  passing  this  signal  through  an  inverse  filter  based  on  the  derived 

predictor  coefficients,  as  shown  in  Fig.  1-2. 

The  description  of  excitation  function  generation  has  so  far  been  phrased  in  terms  of 
voiced  sounds,  where  the  problem  is  most  difficult.  To  account  for  unvoiced  sounds,  a small 
amount  of  white  noise  is  added  to  the  distorted  baseband.  This  noise  will  be  essentially  un- 
noticed during  voiced  sounds,  but  will  produce  the  depired  hiss  excitation  for  unvoiced  sounds. 

In  order  that  the  excitation  energy  be  correct  for  both  voiced  and  unvoiced  sounds,  a gain 
parameter  is  included  in  the  flattening  filter  so  that  its  output  energy  is  equal  (on  a frame-by- 
frame  basis)  to  the  energy  in  the  residual  error  signal  of  the  original  speech. 

The  excitaUon  signal  is  passed  through  an  LPC  synthesis  filter  whose  parameters  have 
been  determined  at  the  transmitter  by  LPC  analysis  of  the  original  speech.  The  final  step  is 

Hofstetter,  P.  E.  Blankenship,  M.  L,  Malpass,  and  S.  Seneff.  "Vocoder  Implementations 
on  the  Digital  Voice  Terminal,"  1975  EASCON  Proceedings. 

t C.  J.  Weinstein,  "A  Linear  Prediction  Vocoder  with  Voice  Excitation,"  1975  EASCON 
Proceedings. 
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to  filter  out  the  baseband  part  of  this  synthesized  speech  and  then  add  back  in  the  essentially 
clear  band  of  original  speech  which  was  transmitted  over  the  channel.  This  direct  mclusion  of 
the  transmitted  baseband  signal  in  the  output  speech  is  an  important  positive  feature  of  voice- 
excited  vocoders  in  general,  and  VELP  in  particular. 

3.  Illustration  of  System  Operation 

We  will  now  illustrate  VELP  operation  by  means  of  spectrograms,  spectral  cross  sections, 
and  wavefomi  segments  at  various  points  in  the  system.  The  system  used  in  obtaining  this  data 
has  an  overall  bit  rate  of  8 kbps,  LPC  of  8th  order  both  for  vocal  tract  analysis,  and  for  flattening, 
and  a distortion  network  consisting  of  an  asymmetrical  full-wave  rectifier  followed  by  a differ- 
encer.  Complex  sampling  is  used  on  a baseband  extending  approximately  from  150  to  750  Hz, 
and  coding  of  the  baseband  is  5-bit  log  PCM.  The  actual  rate  for  the  baseband  is  5950  bps. 
representing  595  complex  samples/sec.  With  the  spectral  data,  the  precise  overall  bit  rate  is 

thus  8072  bps. 

Wideband  spectrograms  of  the  input  speech  and  VELP  output  for  the  utterance,  "We've 
changed  the  measures,"  spoken  by  a male  talker,  are  shown  in  Fig.  1-3.  It  is  seen  that  the 
gci  eral  spectral  shape  is  fairly  well  maintained.  Probably  the  most  visible  effects  of  VELP 
are  the  irregularity  of  the  pitch  striations  and  the  generally  noisy  appearance  of  the  output 
speech  spectrum.  These  effects  are  perceived  as  a slight  hoarseness  in  the  output  speech. 

The  explanation  seems  to  be  that  the  process  of  regenerating  and  flattening  the  speech  harmonics 
leads  to  an  excitation  spectrum  which  is  not  as  clean  as  would  be  desired,  in  that  the  spectral 
peaks  representing  the  harmonics  are  broadened,  and  the  ratio  of  the  level  of  the  harmonics  to 
the  spectral  hash  between  harmonics  is  lowered.  This  effect,  and  other  aspects  of  system 
operation  are  more  clearly  seen  in  the  spectral  cross  section  pictures  of  Figs.  I-4(a-f),  which 
are  taken  during  the  vowel  in  the  word  "changed,"  as  indicated  by  the  markers  under  the 
spectrograms. 

In  each  of  the  Figs.  I-4(a-f),  a spectral  cross  section  is  shown  above  the  waveform  segment 
analyzed.  The  pictures  represent  measurements  at  points  labeled  a through  f in  Fig.  I 1. 

Each  spectrum  analysis  was  carried  out  at  the  same  location  in  time  with  respect  to  the  begin- 
ning of  a digitally  stored  version  of  the  input  utterance.  The  sampling  interval  was  140  ,isec. 
and  spectral  amplitude  values  (linear  scale)  are  displayed  from  0 Hz  to  the  half  sampling  fre- 
quency. 3572  Hz.  Each  spectrum  was  calculated  by  a DFT  (discrete  Fourier  transform)  of  512 
samples  of  hanning-windowed  waveform.  The  spectral  samples  produced  by  the  DFT  are  there- 
fore spaced  by  7144/512  = 14  Hz.  Every  fourth  point  on  the  spectrum  displays  represents  an 
actual  DFT  magnitude  output.  The  remaining  points  were  filled  in  by  linear  interpolation.  The 
spectral  analysis  parameters  were  selected  as  described  in  order  to  provide  a revealing  display 

of  the  harmonic  structures  of  the  various  spectra. 

Figure  I-4(a)  represents  the  digital  input  speech,  produced  by  low-pass  filtering,  pre- 
emphasis, and  sampling  of  the  analog  speech.  The  harmonic  structure  as  well  as  the  spectral 
envelope  shape  (characteristic  of  a nasalized  vowel)  are  apparent.  Figure  I-4(b;  represents 
the  baseband  signal  as  reconstructed  at  the  receiver.  The  2nd.  3rd.  and  4th  harmonics  have 
been  retained  with  fairly  good  fidelity  but  the  combined  effects  of  aliasing  due  to  filter  sidclobes, 
PCM  coding  noise,  and  filter  roundoff  errors  have  introduced  some  small,  undesired  signal 
components  both  inside  and  outside  the  baseband.  The  input  to  the  spectral  flattener  is  shown 
in  Fig.  1-4 (c).  Many  of  the  harmonics  have  been  regenerated,  but  the  amplitude  envelope  of  the 
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harmonics  is  Car  from  Hat.  It  is  apparent  also  that  the  harmonic  structure  is  not  nearly  as 
clean  as  in  the  oi'iginal  speech.  Many  of  the  loW“level  harmonics  are  broadened  and  smeared, 
some  harmonics  appear  to  be  missing,  and  a visible  hash  level  is  superposed  on  the  entire 
spectrum.  Tlic  output  of  the  spectral  I'lattener  is  shown  in  Fig.  I-4(d).  The  larger  variations 
in  spectral  envelope  have  been  removed,  although  sizable  local  variation  among  adjacent  har- 
monics remains.  Such  local  variations  tend  to  cause  important  problems  only  if  tliey  oceur-  in 
the  vicinity  of  a sharp  vocal  tract  resonance.  In  this  example,  and  in  many  other  cases  whicli 
have  been  observed,  the  worst  combination  of  lack  of  excitation  ilatness  plus  strong  vocal  tract 
resonance  occurs  in  the  baseband  itself.  Thus,  if  we  compare  the  synthesis  filter  output  spec- 
trum in  Fig.  I-4(e)  to  the  original  spectrum  of  Fig.  l-4(a),  we  notice  large  spectral  distortion 
in  the  baseband.  For  example,  the  3rd  harmonic  is  more  than  twice  as  large  as  the  2nd  in 
Fig.  l-4(e).  Fortunately,  this  effect,  which  is  quite  important  perceptually,  is  corrected  by 
the  re-introduction  of  the  baseband  signal  into  the  output  speech.  In  the  final  output  shown  in 
Fig.  l-4(f),  we  see  that  the  amplitude  relationship  of  the  2nd  and  3rd  harmonics  is  the  same  as 
m the  original  speech.  Of  course,  the  fundamental  has  been  removed  by  the  baseband  filter  and 
is  missing  in  the  output  speech,  since  the  baseband  elimination  filter  employed  in  this  system 
was  high-pass  at  750  Hz, 

4.  Speech  Quality  Evaluation 

The  following  comments  on  VELP  arc  based  on  informal  listening  tests  and  reflect  the 
opinions  of  several  experienced  listeners.  Detailed  subjective  testing  of  VELP  has  not  been 
carried  out,  but  fairly  extensive  listening  was  possible,  since  the  system  runs  in  real  time  on 
the  FDP  at  Lincoln. 

The  8-kbps  VELP  system  was  compared  against  an  8-kbps  APC  system  and  a 3.5-kbps 
pitch-excited  LPC  system,  each  implemented  on  the  FDP.  For  high-quality  speech  inputs, 

VELP  and  APC  produced  output  speech  of  roughly  similar  quality.  However,  a small  residual 
hoarseness  in  VELP  caused  most  listeners  to  express  a preference  for  APC.  The  LPC  system 
possessed  the  usual  lack  of  naturalness  associated  with  a pitch-excited  system.  The  same  three 
systems  were  compared  on  a recorded  sample  of  speech  which  had  been  transmitted  through  a 
local  dial-up  connection  in  the  public  telephone  system.  Here  the  results  were  somewhat  dif- 
ferent, as  pitch  glitches  became  somewhat  noticeable  in  APC  and  extremely  bothersome  in 
LPC.  The  VELP  quality  was  least  degraded  by  the  telephone  speech,  supporting  the  assertion 
that  voice-excitation  is  relatively  robust  in  this  type  of  environment. 

K.  MISCELLANEOUS 

Our  work  on  variable-rate  coding  algoritlims  for  LPC  has  presently  come  to  a halt.  In  tiie 
previous  SATS,  we  showed  results  cf  our  equal-area  coding  scheme.  We  made  an  attempt  to 
compare  the  efficiency  of  this  scheme  with  that  of  the  rt.-iore  conventional  log-area  coding  scheme 
used  by  NSC.  Our  criterion  of  comparison  was  the  meg'; -squared  difference  between  the  uncoded 
and  coded  reflection  coefficients  for  the  two  schemes.  No  definitive  results  could  be  obtained; 
on  further  reflection  we  concluded  that  only  a comparison  based  on  subjective  speech  tests  would 
be  significant.  What  remains  to  be  done  for  the  speech  compression  community  at  large  is  a 
comprehensive  test  program  aimed  at  determining  the  best  of  a variety  of  proposed  coding 
schemes. 
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In  our  judgment,  LPC  speech  quality  varies  quite  notieeably  among  speakers.  In  particular, 
female  speakers  ai^pear  to  cause  greater  difficulty  than  male  speakers.  In  a preliminary  attempt 
to  understand  tliis  problem,  we  have  modified  an  existing  real-time  LPC  program  so  that  a given 
speaker  of  eategory  male,  female,  or  child  can  be  perceptually  transformed  into  a different 
category.  Based  on  informal  listener  responses,  the  results  have  been  quite  successful.  To 
perform  this  transformation  we  adjust  two  parameters  independently:  the  pitch  and  the  spectrum. 
The  pitch  adjustment  is  easily  carried  out  by  multiplying  the  measured  pitch  by  a scale  factor 
while  the  spectral  adjustment  involves  an  apparent  time  speedup  (such  as  would  be  attained  by 
ehanging  the  playback  speed  of  a tape  recorder)  but  maintaining  the  overall  speech  rate  output 
the  same  as  tlie  input  rate.  If  a simple  nondistortionless  method  of  implementing  this  trans- 
formation eould  be  found,  it  could  result  in  a significant  improvement  in  LPC  quality. 
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II.  ACOUSTOELECTRIC  CONVOLVERS 


A.  INTRODUCTION 


I 


The  Lincoln  Laboratory  effort  on  acoustoelcctric  convolvers  for  packet  radios  is  motivated 
by  a desire  to  increase  the  instantaneous  bandwidth  and  signal-processing  gain  of  continuously 
variable,  pseudorandom-codc  modems.  Collins  Radio  is  currently  building  for  AREA  several 
packet  radio  repeaters.  Lincoln  Laboratory  is  to  supply  the  first  three  acoustoelcctric  con- 
volver subsystems  for  the  repeaters. 

The  convolver  subsystem  is  being  developed  in  two  steps.  An  initial  convolver  prototype 
was  developed  during  this  report  period  which  meets  the  requirements  for  a single-bit  convolver, 
and  two  convolvers  and  a test  circuit  were  built  and  evaluated  at  Lincoln  Laboratory.  The  fin- 
ished units  are  currently  being  transferred  to  Collins  Radio  for  their  evaluation.  We  are  con- 
verging on  a final  specification  for  a convolver  subsystem  for  the  packet  radio  repeaters,  and  the 
current  subsystem  specifications  are  contained  in  this  report.  Also,  the  acoustoelectric  convolver 
technology  is  being  transferred  as  rapidly  as  possible  to  industrial  sources. 

We  are  continuing  the  study  of  rapidly  acquiring  a time -slotted  signal,  with  a continuously 
changing  code  structure,  air  current  plans  are  to  develop  a circuit  which  will  permit  the  rapid 
acquisition  of  an  incoming  differential-phase-shift-keyed  (DESK)  signal. 

We  are  exploring  the  feasibility  of  a coherent  integrator,  a new  device  concept  which  might 
provide  signal  processing  gaiqs  as  large  as  10^  The  coherent  integrator  is  an  outgrowth  of  our 
convolver  technology.  It  consists  of  a block  of  lithium  niobate  with  a strip  of  silicon  which  is 
spaced  a fraction  of  a micrometer  away  from  the  lithium  niobate  surface.  The  silicon  interface 
is  covered  with  a two-dimensional  array  of  Schottky  diodes  with  polysiUcon  overlays.  Some 
initial  results  are  included  in  this  repoxH. 

B.  ACOUSTOELECTRIC  CONVOLVER 

Two  views  of  the  finished  convolver  package  are  shown  in  Fig.  II-l.  The  basic  assembly 
procedures  arc  similar  to  those  described  in  the  previous  SATS."  The  silicon  strip,  which  is 
30  mils  wide  and  inches  long,  has  a backing  of  metal,  and  10 -mil  gold  wires  are  bonded 
to  the  middle  and  ends  of  this  strip.  The  silicon  rests  on,  and  the  wires  pass  through  a block 
of  silicone  potting  gel  (RTV-602).  Just  before  assembly  is  attempted,  the  lithium  niobate  and 
silicon  interfaces  are  coated  with  liquid  collodion.  The  liquid  dries  rapidly,  and  any  dust  on 
the  surfaces  is  captured  in  the  congealed  film.  The  film  is  pulled  off  the  surfaces  Just  prior 
to  assembly  and  the  freshly  cleaned  surfaces  are  brought  into  contact.  The  brass  mounting 
structures  are  bolted  together,  which  compresses  the  RTV  sufficiently  to  apply  0.1 -atm  pressure 

to  the  silicon  relative  to  the  lithium  niobate  spacer-posts. 

The  input  signals  are  at  300  MHz,  and  when  two  10-p.sec-long  signals  convolve  beneath  the 
silicon,  then  the  output  signal  is  at  600  MHz.  Since  the  dielectric  constant  of  LiNb03  is  approx- 
imately 50,  the  1.4-inch  silicon  acts  like  a half -wavelength-long  transmission  line.  The  metal 
ground  planes,  which  are  visible  on  the  surface  of  the  lithium  niobate  [Fig.  Il-l(a)]  are  designed 
to  provide  a characteristic  impedance  of  30  ohms  to  this  Si-LiNbO^  transmission  line.  A con- 
siderable amount  of  distortion  would  be  obtained,  due  to  the  long-line  effects  in  the  output 

^Information  Eroccssing  Techniques  Erogram  Semiannual  Technical  Summary,  Lincoln 
Laboratory,  M.I.T.,  Vol.  I (31  December  1974). 


Fig.  ll-l(a).  Open  convolver  structure  showing  lithium  niobate  and  silicon  strip. 
The  lithium  niobate  has  deposited  on  its  surface  metallic  transducers  and  grtxmd 
planes.  The  silicon  strip  is  on  a pillow  of  KTV  gel.  The  silicon  is  pressed 
against  the  stand-off  spacers  located  in  the  gap  between  ground  planes. 


Fig.  11-1  (b).  Convolver  with  cover  removed.  The  inner  brass  block  contains  the  output 
transformer  in  a central  cavity  and  two  terminations  for  the  silicon  transmission  line  are 
in  two  cavities  at  the  edge.  An  airtight  seal  is  formed  between  the  outer  block  and  the 
cover,  and  the  device  is  coated  with  a sealant.  The  package  housing  is  evacuated  and 
back-filled  with  dry  nitrogen  through  the  entry  hole  located  adjacent  to  the  output  termi- 
nal. Quarter-wave  impedance  inverter  networks  are  attached  to  the  two  input  terminals. 
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circuitry,  unless  care  is  taken  to  terminate  the  ends  of  the  silicon  with  the  characteristic  im- 
pedance of  the  30  ohms.  That  is  why  a wire  is  bonded  to  each  end  of  the  silicon.  The  bonding 
wire  has  a 10-ohm  inductive  reactance  which  is  compensated  with  a 26-pK  capacitor.  The  out- 
put terminal  at  the  center  of  the  silicon  has  a characteristic  impedance  of  15  ohms,  which  is 
connected  to  a 4:1  impedance  transformer.  In  Fig.  ll-l(b)  the  output  transformer  is  visible  in 
the  center  of  the  inner  brass  block,  as  well  as  the  terminations  connected  to  each  end  of  the 
silicon  strip.  The  input  transducers  have  a wide  fractional  bandwidth  of  30  percent,  which  re- 
sults in  an  input  impedance  of  about  300  ohms  of  reactance  in  series  with  50  ohms  of  radiation 
resistance.  A quarter-wave  impedance  inverter  network  is  attached  to  the  input  terminal.  It 
consists  of  a quarter-wavelength-long  cable  with  a cliaracteristic  impedance  of  50  ohms,  a two- 
to-one  impedance  transformer,  and  tunable  inductor  (see  Fig.  Il-l(b))  . The  input  impedance  over 
the  frequency  band  of  interest  is  shown  in  Fig.  ll-2(a).  Notice  that  the  input  impedance  follows 
a constant  K circle  on  the  Smith  chart,  which  is  equivalent  to  a VSWR  of  3.0,  and  a reflection 
loss  of  1 dI3.  The  output  impedance  plot  is  shown  in  Fig.  ll-2lb).  The  residual  reactance  at  the 
output  is  in  part  due  to  stray  reactance  associated  with  the  output  transformer.  Note  that  the 


VSWR  is  3,  as  in  the  input  circuit. 


Fig.  11-2.  (a)  Smith  chart  plot  of  input  impedance.  Frequency  of  interest 

is  from  2 50  to  350  .MHz.  Kffective  VSWR  is  3:1.  (b)  Smith  chart  plot  of  the 
output  terminal  impedance  from  500  to  700  MHz.  Effective  VSWR  is  ap- 
proximately 3:1. 

Four  convolvers  were  built  on  substrates  675,  680,  682.  and  685.  Three  of  the  convolvers 
have  essentially  identical  characteristics.  The  fourth  convolver,  unit  682,  has  an  excess  inser- 
tion loss  of  10  dB,  which  has  been  traced  to  a residue  of  collodion  particles  on  the  crystal  face. 

A critical  test  of  the  uniformity  of  convolvers  is  to  convolve  a short  CW  impulse  with  a long  pulse. 
If  the  convolver  interaction  is  uniform,  then  the  expected  output  signal  should  remain  constant  as 
the  short  impulse  traverses  the  convolver  interaction  region.  Figure  11-3  shows  this  output  for 
unit  685.  A variation  of  ±1  dB  is  obtained  in  this  device.  This  variation  was  traceable  to  a 
residual  impedance  mismatched  at  the  ends  of  the  silicon.  Unit  680.  with  improved  terminations, 
does  not  exhibit  such  fluctuations. 

The  convolution  of  two  CW  pulses  10  psec  long  creates  a characteristic  triangular  waveform. 
The  peak  amplitude  of  this  convolution  signal  as  a function  of  frequency  is  plotted  in  Fig.  11-4  as 
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Fig.  n-4.  CXitput  signal  level  as  a func- 
tion of  input  frequency  for  10-psec  CW 
pulses  at  10  dBm.  Notice  that  the  sig- 
nal is  uniform  to  ±0.5  dB  over  the  100- 
MHz  bandwidth. 


Fig. 11-3.  Convolution  of  a 1. 3 -psec  im- 
pulse with  a 24-psec  pulse.  Notice  the 
variation  of  convolution  signal  as  a func- 
tion of  position.  This  is  due  in  part  to 
imperfectly  terminated  silicon  strips. 


Fig.  II-5.  CXitput  signal  level  at  600  MHz 
in  dBm  vs  the  product  of  the  300-MHz  in- 
put signals  in  dBm.  The  two  input  signals 
are  kept  equal  and  their  logarithms  are 
summed  in  this  figure.  The  1-dB  com- 
pression point  occurs  when  both  input  sig- 
nals are  at  14  dBm. 


a function  of  input  frequency.  Notice  that  the  response  of  the  convolver  is  flat  within  ili  clB 
over  the  100-Mllz  band  of  interest.  The  convolver  is  a nonlinear  element  in  the  sense  that  the 
output  signal  is  proportional  to  the  product  of  the  input  signals.  For  that  reason,  the  output 
amplitude  is  plotted  as  a function  of  the  sum  of  the  log  (in  dBm)  of  the  two  input  signals  inFig.lI-5. 
hi  this  instance,  the  two  inputs  are  kept  equal  and  are  varied  in  step.  Notice  that  the  output  is 
linear  with  respect  to  the  inputs  up  to  input  levels  of  -114  dBm,  where  a deviation  from  linearity 
of  1 dB  is  obtained.  The  output  bandwidth  is  200  Mllz,  and  a reasonable  noise  floor  created  by 
an  amplifier  with  a noise  figure  of  3 dB  is  -88  dBm.  Consequently,  the  available  dynamic  range 
between  this  noise  floor  and  the  1-dB  compression  point  is  50  dB. 

The  principal  spurious  signal  in  the  convolver  is  due  to  the  self-convolution  of  an  input  signal. 

If  a CW  signal  is  entered  at  a terminal,  then  a reflection  which  occurs  at  the  second  terminal  sub- 
sequently convolves  with  the  input.  The  peak  amplitude  of  this  self-convolution  signal  is  plotted 
in  Fig.  ll-b  as  a function  of  input  power  level.  Notice  that  this  self-convolution  signal  decays 
appro.ximately  as  tlie  square  of  the  input  power.  The  self-convolution  of  a coded  signal  is  sub- 
stantially less  than  the  CW  signal  as  a result  of  the  lack  of  correlation  between  the  beginning  and 
the  end  of  a code  sequence.  We  have  found  with  pseudorandom  codes  that  the  self-convolution 
signal  is  about  20  dB  weaker  than  the  self-convolution  of  a CW  signal.  Only  two  data  points 
could  be  observed  because  of  the  e.xcessive  noise  Boor  in  our  measurement  equipment.  Another 
spurious  signal,  the  direct  feed-through  of  the  input  signal  to  the  output  terminal,  is  70  dB  below 
the  input.  However,  this  is  suppressed  readily  with  a high-pass  filter.  No  other  spurious  signals 
were  observed  with  the  convolver. 


UNIT  685 
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Fig.  II-6.  Spurious  output  signal  level 
due  to  double-transit  reflection  vs  input 
power  level.  Spurious  signal  decreases 
as  the  square  of  the  input.  Coded  signals 
do  not  correlate,  consequently  they  pro- 
duce spurious  signals  which  are  20  dB 
weaker  than  CW  signals. 
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Fig.  II-7.  Acoustoeiectric  convolver  test  set  schematic 
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Fig.  II-8.  Block  diagram  of  acoustoeiectric  convolver  test  set,  showing 
electronic  functions. 
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The  performance  characteristics  of  tlie  acoustoclectric  convolvers  arc  summarized  below. 
They  conform  to  the  I'cquiremcnts  tor  the  packet  radio  repeater,  with  the  exception  of  VSWR, 
where  the  requirements  arc  2;1.  A modest  effort  should  correct  this  difficulty. 


Center  frequency 
Bandwidth 

Convolution  interval 
Input  VSWU 
Output  VSWR 

Maximum  reference  power 
Maximum  signal  power 
Max'imum  output  signal 
Convolution  loss 
Maximum  double-transit  signal 


300  MHz 
100  MHz 
10  psec 


+10  dBm 
+10  dBm 
-45  dBm 
-66  dBm 


-70  dBm  {CW)‘  -90  dBm  (coded) 


C.  CONVOLVER  TEST  SET 


An  acoustoelectric  convolver  test  set  was  designed  and  built  by  Lincoln  Laboratory,  for  use 
by  the  Collins  Radio  Company. 

Figure  11-7  is  a simplified  system  block  diagram.  A feedback  shift  register  generates 
a pseudorandom  code.  This  code  is  reversed  through  the  bit  reverser  module.  To  correct  for 
the  time  delay,  the  non-reversed  eode  is  sent  through  a similar  bit-delay  module.  These  pseudo- 
random codes  modulate  the  300-MHz  RF  in  biphase  modulators.  The  biphase  modulated  (0  to  180°) 
signals  are  applied  to  ports  1 and  2 of  the  eonvolver. 

The  two  biphase  eoded  RF  signals  propagate  past  eaeh  other  in  the  eonvolver.  Since  one  sig- 
nal is  a time -reversed  repliea  of  the  other,  an  autoeorrelation  impulse  is  obtained  in  the  con- 
volver. Also,  sinee  both  signals  are  propagating  in  opposite  direetions,  the  eluster  frequency 
of  the  output  is  twice  the  input  frequency,  or  600  Mllz. 

The  600 -MHz  output  is  amplified  and  applied  to  a doubly  balanced  mixer  used  as  a coherent 
detector.  The  local  oscillator  (LO)  comes  from  the  300-MHz  source  which  is  doubled  to  600  MHz 
and  fed  through  a phase  slufter  to  peak  the  detected  response  of  the  coherent  detector.  Tlie  re- 
sulting video  signal  is  amplified  and  displayed  on  a wideband  oscilloscope. 

The  acoustoelectric  convolver  test  set  is  shown  in  more  detail  in  the  block  diagram  in  Fig.  11-8. 
AH  frequencies  are  generated  by  a temperature-compensated  crystal  oscillator  (TCXO).  A 100-MHz, 
-1^ litter  €ouplfc4  lugle  (ECt,)  Guinpatlble  sl^al  is  seiit  to  Uie  loglo  Glroulte  wei-e  It  Is  used  as  tlw 
digital  system  clock.  A second  TCXO  output  at  300  MHz  is  fed  to  a 4-port  splitter.  Two  of  the 
by  wrrrerA  drivers  Cl’OK.iei Is  proa+i , p 

attenuator  which  is  used  to  adjust  the  level  of  that  channel.  The  attenuator  output  is  amplified 
by  a driver  amplifier,  filtered  by  a 300-MHz  bandpass  filter,  and  applied  to  output  port  A. 

Channel  B is  fed  straight  through  the  preamplifier,  driver,  and  filter  to  output  port  B.  Max- 
imum power  of  +23  dBm  is  available  at  output  port  B;  and  since  it  is  run  at  a fixed  level,  it  can 
be  considered  the  reference  channel. 

A tMrd  output  from  the  splitter  is  phase  sliifted,  amplified,  and  doubled.  The  resulting 
600 -MHz  signal,  is  filtered  and  amplified.  This  signal  is  used  as  the  LO  of  the  coherent  detector. 
The  phase  shifter  is  continuously  variable  to  allow  peaking  of  the  coherent  detector. 


> Convolution  loss  = 10  (log  + log  ^gig 


Pout)- 


j,  iki'-f',!,,'.  ' ' 
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The  fourth  output  of  the  power  splitter  is  used  for  diafpiostics  and  troubleshooting  and  is 
normally  terminated  in  SO  ohms. 

The  return  from  the  convolver  is  filtered  by  a 600-MHz  bandpass  filter  and  amplified  by  a 
low-noise  preamplifier.  The  output  of  the  preamplifier  is  applied  to  the  coherent  detector  where 
it  is  mixed  with  a 600-Mllz  LO.  The  resulting  difference  frequency  (a  video  pulse)  is  filtered  by 
a low-pass  filter  and  amplified  by  the  video  amplifier.  The  output  is  displayed  on  a wideband 
oscilloscope. 

The  front  panel  is  illustrated  in  Fig.  II-9.  CTiannel  A HF  Out  and  its  step  attenuator  are  to 
the  riglit,  and  Cliannel  B KF  Out  is  to  the  left.  These  two  ports  are  hooked  to  the  convolver  under 
test.  The  600-Mllz  return  from  the  convolver  is  fed  into  the  Kl'~  In  port.  The  LO  phase  is  ad- 
justed with  the  V adjust  knob  for  maximum  output  on  the  display  once  the  system  is  functioning. 


Fig.  11-9.  Front  prnel  of  acoustoelectric  convolver  test  set. 


Several  toggle  switches  control  the  digital  logic  board.  The  start/stop  switch  allows  either 
a continuously  changing  pseudorandom  code  (START)  or  a fixed  code  (STOn  to  be  selected.  The 
bit  code  and  the  corresponding  switches  allow  the  user  to  select  the  code  length  according  to  the 
following  chart. 

Switch  1 2 4 

0 0 0 8-bit  code  (2**-l)  0 = down 

10  0 11 -bit  code  (2'^-l)  1 = up 

0 1 0 33-bit  code  (2^^-l) 

1 1 0 36-bit  code  (2^^-l) 

0 0 1 N/A 

1 0 1 N/A 

0 1 1 N/A 

111  20 -nsec  square  wave  — for  diagnostics 
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The  v>  reverse  switch  allows  the  biphase  code  to  be  normal  or  inverted,  thus  chanipnii  the 
polarity  of  the  autocorrelation  pulse.  The  code  generator  reset  allows  the  digital  circuits  to  be 
reset  in  the  event  tliat  all  "O's"  are  entered  in  the  code  generator  during  turn-on  or  setup. 

The  acoustoelectric  convolver  test  set  has  been  designed  to  provide  considerable  ver.satility 
and  flexibility.  It  should  aid  in  the  testing  and  analysis  of  the  acoustoelectric  convolvers  cur- 
rently under  development  by  l.incoln  for  packet  radios. 

1).  KKSL'LTS 

At  present,  a considerable  amount  of  distortion  is  observed  in  the  pliasc -modulated  signal 
from  the  test  set,  a typical  code  sequence  contains  chips  which  are  not  the  requisite  <i0  nsec 
long,  and  a considerable  amount  of  amplitude  modulation  is  present  as  well.  These  distortions 
tend  to  produce  energy  in  a sidelobe  following  the  main  correlation  impulse,  which  s on  the 
average  -17  dl!  oelow  the  main  impulse.  This  can  readily  be  seen  in  Fig.  II-10(al  which  shows 
the  correlation  impulse  for  a running  code,  the  spurious  sidelobe,  while  other  sidelobcs  are 
blurred  by  the  code.  A typical  sidelobe  structure  for  a repeated  code  is  shown  in  Fig.  ll-lO(b) 
with  time  sidelobes  as  large  as  —14  dll  below  the  main  impulse. 


Fig.  11-10.  (a)  Correlation  impulse  obtained 

with  a continuously  changing  RF  code.  No- 
tice the  correlation  impulse  has  the  expected 
29-nsec  duration  time,  and  the  time  side- 
lobes  are  more  than  20  dH  below  the  impulse. 
Deviation  from  the  triangular  wave  shape  is 
due  to  bandwidth-limiting  effects  of  input  and 
output  circuits.  A spurious  sidelobe  -17  dB 
below  the  main  impulse  is  caused  by  distor- 
tion present  in  the  input  waveforms,  (b)  Top- 
ical sidelobe  structure  of  repeated  code.  Note 
change  in  time-scale. 


(a) 


(b) 


E.  CONCLUSIONS 

All  principal  design  goals  for  acoustoelectrie  convolvers  have  been  met,  and  the  preliminary 
results  suggest  that  a convolver  should  function  exceptionally  well  ir  systems  with  pseudorandom 
phase-sliift-keyed  modems.  It  is  therefore  recommended  that  aeousfoeleetrie  convolver  sub- 
systems  be  developed  for  packet  radio  repeaters. 

F.  COHERENT  INTEGRATORS 

The  storage  of  a referenee  signal  with  an  array  of  Sehottky  diodes,  and  the  eross-eorrelation 
of  subsequent  signals  with  this  referenee  was  deseribed  in  an  artiele  in  Applied  Physies  Letters 
in  June  1975.*  We  have  modified  this  strueture  to  provide  for  the  eoherent  overlay  of  a sueeession 
of  RF  signals.  In  the  aforementioned  referenee.  Sehottky  diodes  are  contaeted  with  metal  over- 
lays that  aet  as  eapaeitors,  and  an  equivalent  eireuit  of  these  diodes  and  eapaeitors  is  shown  in 
Fig  II-ll,  in  wliieh  R = Sehottky  diode  resistanee.  = Sehottky  diode  eapaeity,  and  - plate 
eapaeitor.  When  the  lehottky  diodes  are  forward  biased  R^C^  = lO"^  see.  and  when  reverse 
biased  R “C  “ = 10'^  see.  If  the  diodes  are  forward  biased  with  a voltage  impulse  which  is  short 
in  time  relative  to  the  RF  period,  then  the  piezoeleetrie  field  assoeiated  with  an  aeoustie  wave 
on  the  adjaeent  surfaee  of  LiNb03  eauses  a eurrent  pulse  to  Row  from  the  diode  eontaet  into  the 
silieon,  whieh  results  in  a eharge  distribution  in  the  diode  array  whieh  eontains  a repliea  of  the 
aeoustie-wave  field.  These  eharges.  and  the  eleetrostatie  fields  assoeiated  with  them,  distort 
the  underlying  earrier  distribution  of  the  silieon.  A subsequent  surfaee-wave  signal  will  eross- 
eorrelate  with  this  earrier  distribution,  and  a eorrelation  signal  appears  at  the  output  terminal. 


Fig.  II-ll.  Schematic  showing  a silicon 
strip  with  a Sehottky  diode  matrix  mounted 
0.2  jim  away  from  the  LiNbOs  surface- 
wave  delay  line.  Also  shown  is  a photo- 
graph of  the  Sehottky  diode  matrix  and  an 
equivalent  circuit  of  one  diode. 


Phase-coherent  integration  of  a succession  of  reference  signals  requires  the  overlay  of  one 
signal  on  another.  This  is  not  possible  with  tliis  Sehottky  diode  circuit,  because  the  previously 
stored  signal  is  lost,  within  a nanosecond  whenever  the  diodes  are  forward  biased. 

=:q<.A.  Ingebrigtsen,  R.  A.  Cohen,  and  R.  W.  Mountain,  Appl.  Phys.  Lett.  (June  1975). 
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We  have  modified  the  Sciiottky  diode  array  by  replacing  the  metallic  overlay  with  a 
polycrystalline-silicon  film  overlay.  This  has  the  effect  of  putting  in  series  with  Cp  a resistor 
R , with  values  consistent  with  RpCp  sec.  In  our  device,  Cp  is  approximately  16  times 

larger  than  C with  zero  or  slightly  reverse -biased  diodes.  Thus,  most  (16/17)  of  the  charge 
on  C is  transferred  to  C in  several  microseconds.  If  the  Schottky  diodes  are  forward  biased 
a second  time,  then  an  additional  charge  is  transferred  from  C^  to  Cp.  If  this  is  done  with  suc- 
cessive pulses  which  overlay  exactly  the  previous  pulses,  then  a large  number  of  signals  are 
coherently  summed  up  to  the  point  where  the  potential  on  t;p  approaches  the  momentary  potential 
on  C .. 

The  charge  leaks  to  the  bulk  silicon  through  the  back-biased  diodes  with  a time  constant 
T = R ”(C  ” + C ),  which  is  approximately  100  msec.  A 2.75-cm  x 20-mm  strip  of  n-type  silicon 
with  a bulk  resistivity  of  30  ohm-cm  was  u.sed.  The  silicon  interface  is  covered  with  a uniform 
array  of  platinum-silicide  Schottky  diodes  each  with  a diameter  of  5 pm.  The  diodes  are  made 
by  etching  5-pm-diameter  holes  in  a 0.1  -pm-thick  oxide.  These  holes  are  overlaid  with  a thin 
layer  of  RF-sputtered  platinum,  and  the  wafer  is  heated  to  600 "C.  Each  diode  is  overlaid  with 
a 10 -mm  x 10 -pm  square  of  poly  silicon  having  a sheet  resistance  of  10  ohms/square.  The 
Schottky  diode  array  is  held  0.2  pm  away  from  the  adjacent  lithium  niobate  with  etched  spacer 
posts  on  the  lithium  niobate  delay  line. 

Since  the  back-biased  resistance  of  the  Schottky  diodes,  and  the  resistance  of  the  polysilicon 
is  quite  high,  the  piezoelectric  field  of  the  surface  wave  terminates  on  charges  in  the  bulk  silicon. 
A 500-V,  5-nsec  voltage  pulse  is  applied  across  the  silicon- Li NbOj  structure  to  forward  bias  the 
Schottky  diodes  a few  tenths  of  a volt  for  a fraction  of  an  RF  period.  TH"  causes  some  of  the 
piezoelectric  field  to  terminate  on  the  diode  contacts,  and  a charge  is  transferred  from  the  bulk 
silicon  which  is  proportional  to  the  instantaneous  local  electric  displacement.  Thus,  a charge 
image  of  the  acoustic  wave  field  plus  a uniform  charge  due  to  the  DC  pulse  is  obtained.  This 
charge  back-biases  the  diodes  and  the  diode  leakage  current  establishes  a decay  rate  of  about 
0.1  sec. 

.Most  of  the  500-V  forward-bias  pulse  appears  across  the  0.75-mm-thick  lithium  niobate 
crystal.  A thinner  crystal  would  require  a proportionally  lower  voltage. 

The  plate  output  voltage  in  Fig.  11-12  is  caused  by  the  correlation  of  a stored  RF  pulse  at  a 
frequency  of  70  MHz  and  a time  duration  of  1.8  psec  with  a similar  reading  pulse.  The  reading 
pulse  occurred  10  msec  after  the  reference  pulse  was  stored.  The  slight  distortion  of  the  triangle 


Fig.  11-12.  Plate  output  signal  resulting 
from  the  correlation  of  a stored  RF  pulse 
at  70  .MHz  and  of  time  duration  1.8  psec 
with  a similar  reading  pulse.  Reading 
pulse  occurred  10  msec  after  reference 
pulse  was  stored. 
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is  caused  by 
put  power  (P 


the  attenuation  of  the  acoustic  signals.  The  efficiency  of  the  device  in  terms  of  out 
),  stored  reference  signal  input  power  (PJ.  and  reading  pulse  power  (P^)  is 


P . - 1'  - 
out  w r 


-83  dBm 


for  a reference  storage  time  of  t msec.  The  correlation  signal  is  linearly  proportional  to  the 
reference  signal  up  to  a power  level  of  30  dBm  and  with  rea^ng  signals  up  to 

The  polycrystalline-silicon  film  of  sheet  resistivity  10  ohms/square  has  associa  cd 
it  a relaxation  time  of  several  microseconds.  This  leaves  any  charge  accumulated  in  the  po  y 
Silicon  unaffected  by  subsequent  8-nsec-long  high-voltage  pulses.  A charge  brought  to  the 
platinum-silicide  dot  by  a high-voltage  pulse  will  require  several  microseconds  to 
the  polysilicon  island.  Since  the  capacity  Cp  is  substantially  larger  than  the  capacity  C ^ the 
Schottky  dot.  most  of  the  charge  ends  up  in  the  polysilicon.  If  the  high-voltage  pulse  is  exactly 
timed  with  respect  to  the  input  signal,  then  the  charge  in  the  polysilicon  should  build  up  in  phase 
with  consecutive  writings,  and  it  should  be  possible  to  coherently  integrate  a signal. 


Fig  1,-13.  Plate  output  signal  during  a coherent 

^:^:^s":rth'’“a^.‘l;;?erpre' pTA^  oc^ufs  every  1.2  sec^  Reading 

decay  of  the  stored  signal. 


Figure  11-13  shows  the  result  of  a coherent  integration  of  an  KF  pulse.  Twelve  consecutive 
S-  a it  sc  Dulses  occur  every  1.2  sec  with  a minimum  interpulse  period  Tp  of  8 msec.  The 
output  amplifier  is  blocked  with  a switch  during  application  of  the  high-voltage  pulse,  which  causes 
r vL,,=.,  »n«  .n  .he  plC«™.  A c-.dl.g  C-s.  ■■■•»  »' 

, 5 X 10V.~  ""PJ-  Tl»  «riU»g  ."0  P“'-« 

With  Fig  n-12.  The  figure  shows  a buildup  of  the  output  signal  during  the  initial  integration  pc 
"d  If  96  msec.  The  lugh-voltage  pulses  are  then  gated  off  to  show  the  decay  of  the  stored  signal. 
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or  Oooa,  roUowo  appoo.^orel,  .o  r' “ 

o,  3.  »=ec,  .heoe.0  .he  =uhse,ue„.  oo—  = ,,,,,  „„e. 

60  msec.  In  this  instance,  the  leveling  o e processor  with  unique  capabilities. 

eonclhsien,  we  have  he»o„e.ra.eh  ...,eee  time  d.ea- 

II  seems  feasible  to  stere  with  this  °"'=®  loj  oI  sevefal  tens 
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